Goto

Collaborating Authors

 fast prototyping and benchmarking


Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking

#artificialintelligence

Distributed training has been shown to greatly accelerate the training of RL agents with respect to wall clock time (Mnih et al., 2016; Espeholt et al., 2018). Instead of interacting with a single environment at a time, the agent interacts with a set of differently seeded copies of the environment to diversify experience and increase throughput. For simplicity and to ensure reproducibility, Tonic uses a synchronous training loop illustrated in Figure 3.